Approximate Query Answering by Model Averaging
نویسندگان
چکیده
In earlier work we have introduced and explored a variety of different probabilistic models for the problem of answering selectivity queries posed to large sparse binary data sets. These models can be directly scaled to hundreds or thousands of dimensions, in contrast to other approximate querying techniques (such as histograms or wavelets) that are inherently limited to relatively small numbers of dimensions. In this paper, we extend this work by applying probabilistic model-averaging to the problem of query answering, a scheme that allows the query-answering algorithm to automatically and optimally adapt to both the specific nature of the data and the distribution of queries being issued by a specific user. We demonstrate that on realworld and simulated data sets that model-averaging can reduce the prediction error of any single model by factors of up to 50%. Learning the combining weights is a straightforward and scalable optimization problem that can be easily automated, providing a practical framework for approximate query answering with massive data sets.
منابع مشابه
Cooperative Query Answering for Approximate Answers with Nearness Measure in Hierarchical Structure Information Systems
COOPERATIVE QUERY ANSWERING FOR APPROXIMATE ANSWERS WITH NEARNESS MEASURE IN HIERARCHICAL STRUCTURE INFORMATION SYSTEMS Thanit Puthpongsiriporn, Ph.D. University of Pittsburgh Cooperative query answering for approximate answers has been utilized in various problem domains. Many challenges in manufacturing information retrieval, such as: classifying parts into families in group technology implem...
متن کاملCoXML: A Cooperative XML Query Answering System
The heterogeneity nature of XML data creates the need for approximate query answering. In this paper, we present an XML system that cooperates with users to provide user-specific approximate query answering. The key features of the system include: 1) a query language that allows users to specify approximate conditions and relaxation controls; 2) a relaxation index structure, XTAH, that enables ...
متن کاملProviding Approximate Answers Using a Knowledge Abstraction Database
As database users adopt a query language to obtain information from a database, a more intelligent query answering system is increasingly needed. Relational databases are exact in nature, but effectiveness of decision support would improve significantly if the query answering system returns approximate answers rather than a null information response when there is no matching data available. Thi...
متن کاملCooperative Query Processing via Knowledge Abstraction and Query Relaxation
As database users adopt a query language to obtain information from a database, a more intelligent query answering system is increasingly needed that cooperates with the users to provide informative responses by understanding the intent behind a query. The effectiveness of decision support would improve significantly if the query answering system returned approximate answers rather than a null ...
متن کاملThe Polynomial Method Strikes Back: Tight Quantum Query Bounds via Dual Polynomials
The approximate degree of a Boolean function f is the least degree of a real polynomial that approximates f pointwise to error at most 1/3. The approximate degree of f is known to be a lower bound on the quantum query complexity of f (Beals et al., FOCS 1998 and J. ACM 2001). We resolve or nearly resolve the approximate degree and quantum query complexities of several basic functions. Specifica...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003